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There are several categories to detect and measure driver drowsiness such as 
physiological methods, subjective methods and behavioral methods. The 
most objective method for drowsiness detection is the physiological method. 
One of the physiological methods used is an electroencephalogram (EEG). In 
this research wavelet transform is used as a feature extraction and using 
support vector machine (SVM) as a classifier. We proposed an experiment of 
retrieval data which is designed by using modified-EAR and EEG signal. 
From the SVM training process, with the 5-fold cross validation, Quadratic 
kernel has the highest accuracy 84.5% then others. In testing Driving-2 
process 7 respondents were detected as drowsiness class, and 3 respondents 
were detected as awake class. In the testing of Driving-3 process, 6 
respondents were detected as drowsiness class, and 4 respondents were 
detected as awake class. 
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1. INTRODUCTION 

The everyday activities of human life require transportation. Transportation is used to travel from 
one place to another. One of the transportation methods is done by car. Driving a car is an activity that 
requires high level of concentration, because driving for a long time can cause fatigue. Fatigue causes 
drowsiness which results in a decreased level of concentration in the driver which can result in a traffic 
accident. One effort that can be done to prevent traffic accidents due to drowsiness on the driver is by 
detecting drowsiness before the accident occurs. 

There are several categories to detect and measure driver drowsiness such as physiological methods, 
subjective methods and behavioral methods [1]. There are several subjective methods, testing the respondent 
based on their subjective assestment such as karolinska sleepiness scale (KSS) and stanford sleepiness scale 
(SSS). Brown et al. [2] used KSS to observe the scale of sleepiness in nurses World Health Organization 
worked after shifts or when shifts were carried out, and the result was that nurses were detected to feel more 
drowsy when there was a shift in compared to after a shift. Jewett et al. [3], in the study of drowsiness 
detection with the SSS and psychomotor vigilance task (PVT), information is obtained by observing the 
slow- motion response in humans when drowsy with varying sleep times. In a drowsy condition, human 
response is slower than the awake condition. PVT has better performance than SSS performance. Some 
evidence used in the PVT test saw that a person who is drowsy usually has decreased eye pupil and slower 
motoric reaction speed. The PVT test observes the reaction of the respondent onlooking number on display. 
In our research [4] observed the performance of female and male drivers in PVT-based driving. Performance 
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test from female and male driver have the same fatigue result, that female driver’ s collisions are higher than 
male driver’ s collisions. 

Drowsiness detection and measurement using behavioral methods, focus in facial expression such 
as: head and eye position/ state and frequent yawning [1]. Measure drowsiness using eye aspect ratio (EAR) 
is one of the behaviour methods. In the research by Mehta [5], a mild drowsiness detection system using 
Android was developed that can be used in real-time. The method used is image processing on recorded 
video frames, to detect faces using landmarks, then calculating the EAR and eye closure ratio (ECR) to detect 
driver sleepiness based on adaptive thresholding with a threshold value of 0.25. If the EAR value is less than 
the threshold value, it indicates a state of fatigue. The random forest classifier is used as a classifier with an 
accuracy of 84%. In our research proposed a modification of EAR, by using EAR threshold that calculating 
from EARmin dan EARmax design for individually driver [6]. 

The most objective method for drowsiness detection is the physiological method [1]. One of the 
physiological methods used is an electroencephalogram (EEG). EEG is done by recording human brain 
signals. There are several research analysing EEG signals for driver's drowsiness detection [7]-[16]. EEG 
signals can be extracted with a variety of methods such as wavelet transforms [17]-[18], fast fourier 
transform (FFT) [18], dan autoregressive model (AR) [19]. 

In the research by Li and Chung [18], design and development drowsiness detection system and 
combines with support vector machines (SVM). Hardware design using photoplethysmography (PPG) sensor 
and heart rate variability (HRV), verified based on the percentage of eyelid closure (PERCLOS) pupil. PPG 
signal and then classify driver as drowsy or alert. From this research classification using wavelet feature have 
higher accuracy than fast fourier transform (FFT). In Pauly research in drowsiness detection used a webcam 
camera that has a low resolution to support detect percentage of eyelid closure (PERCLOS), combining 
histogram of oriented gradients (HOG) and SVM. The final results from the system are compared with 
observations by eyes [20]. In Zhang research, designing drowsiness detection for high-speed trains based on 
drivers using wireless EEG, using FFT and the classification with SVM reaches 90.70% [21]. 

Therefore in this research, we propose an experiment of data retrieval which is designed by using 
modified-EAR and EEG signal. In this research, using the neuroheadset emotiv EPOC to see the behaviour 
EEG signals to EAR's driver. EEG signal will classify by using SVM and wavelet transform as a feature 
extraction for drowsiness detection. The early drowsiness detection system is designed, so Alpha waves, Beta 
waves and Theta waves from EEG signal used in this research (start from the conditions’s someone is still 
allertness, relax condition until the light sleep). 


2. THEORY AND DESIGN SYSTEM DROWSINESS DETECTION 
2.1. Theory 

The following are theories used in the process of drowsiness detection based on EEG signals using 
SVM: EEG, wavelet transform and SVM. 


2.1.1. Electroencephalogram (EEG) 

Electroencephalogram (EEG) is an instrument to capture electrical activity in the brain. EEG signals 
are divided into six groups of brain waves that have different frequency ranges. The six main groups of EEG 
signals, namely [22]: 

- Delta waves (<4 Hz), related to deep sleep. 

- Theta waves (4-8 Hz), appears when someone is in a state of sleepiness and light sleep. 

- Alpha waves (8-13 Hz), appears when someone is in a relaxed state and the strongest energy occurs in the 
frontal and occipital cortex. 

- Mu wave (8-12 Hz), related to motor activity and can be used to recognize the purpose of one's 
movements. 

- Beta waves (13-30 Hz), related to the level of alertness and concentration of a person. 

- Gamma waves (30-50 Hz), are associated with one's mental activities such as creativity and problem 
solving. 


2.1.2. Wavelet transformation 

Wavelet transformation is the development of the fourier transformation so that it has the same way 
of working which is to break the signal into several parts. The difference is that the Fourier transform 
provides the frequency information of the signal but does not provide the time information, whereas the 
wavelet transform provides the time and frequency information of the signal. Wavelet transform is suitable to 
analyze non-stationary signals, different from fourier transform is not suitable for non-stationary signals [23]. 
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Wavelets are a family of functions produced by the basis wavelets called mother wavelets [24]. The 
two main operations that underlie the wavelet are translation and scaling. Translation is a form of 
transformation of the time domain. Scaling is a form of transformation of frequency, with the scale value 
inversely proportional to the frequency value. Mathematically, the basic functions of wavelets are written as 
(1) [24]: 


Vap(t) = v( =?) 


() 


where b=location parameter, a=scaling parameter, t=time. Wavelet transform also consists of several types of 
families, namely Daubechies, Symlets, Coiflets, and others [24]. 


2.1.3. Support vector machine (SVM) 

Support vector machine (SVM) was first introduced by Vapnik as a superior method in the field of 
pattern recognition [25]. SVM is a classifier based on a linear discriminant function, that can be used as a 
binary classifier [26]. The concept of SVM is trying to find the best hyperplane that functions as a separator 
of two classes in the input space. Hyperplane is a horizontal plane that functions as a class separator. The best 
separating hyperplane is obtained by finding the maximum margin. Margin is the distance of the hyperplane 
from the closest pattern commonly called a support vector [27]. There are two pattern (positive objects and 
negative objects) which will separate into two classes, Figure | shows some patterns that are members of two 
classes, namely -1 and +1. The patterns incorporated in class -1 can be formulated as: 


Y — — 
w-x+b=-l / W-xX+b=0 
eB B 
B 
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Figure 1. SVM hyperplane that separates the two classes [27] 


wy-x+b=-1. (2) 
While the patterns incorporated in +1 class can be formulated as (3). 
To separate the two patterns perfectly it takes a hyperplane of dimension d which is formulated as (4). 
w-xt+b=0 (4) 
X; : Input vector 
w : Weight vector 
b : Bias 


Cases that occur in the real world are rarely linearly separable; to overcome this problem the SVM 
was modified by including kernel functions. In non-linear SVM, the data x is first mapped by the function 
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(x) to the higher dimensional vector space [23]. In this new vector space, the hyperplane that separates the 
two classes can only be constructed. This concept can be seen in Figure 2. 
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Figure 2. The idea of kernel trick SVM [27] 


In Figure 2, it is shown that the data in the two dimensions input space cannot be linearly separated, 
and then the function @ maps each data to a higher dimension, so that the two classes can be separated 
linearly. Then the training process is the same as in the linear SVM. The optimization process in this phase 
requires the calculation of the dot product two examples in the new vector space. The dot products of the two 
vectors (xj) and (x;) are denoted as ®(x;). ®(x;). The dot product values of these two vectors can be calculated 
indirectly, that is, without knowing the transformation function ®. This computational technique is called 
Kernel Trick, which calculates the dot product of two vectors in a new vector space by using the components 
of both vectors in the original vector space. Several types of kernels in the SVM: linear kernel, polynnomial 
kernel and gaussian kernel [26]. 


2.2. Design system drowsiness detection 
2.2.1. Data collection scenarios 

Based on the accident traffic data from Korlantas Indonesia in 2018, there are three groups of 
accident victims namely at the age of 15-19 years old, 20-24 years old and 25-29 years old [15]. And also 
from the National Highway Traffic Safety Administration (NHTSA), crashes occurred during the period 
between 1989 and 1993 in the United States, there are approximately 100,000 crashes per year identified with 
drowsiness, the highest road traffic death by age range are 15-29 [1].Therefore in this experiment, the ages 
range of respondent are 19-26 years old, with a total of ten respondent in good health and have sufficient 
sleep. The scenario of drowsiness detection experiment design is summarized in Figure 3: 


Driving-1 Process EAR Calculated 
(2 mimutes) & EEG Recorded 


Driving-2 Process EAR Calculated 
(10 mimutes) & EEG Recorded 


Arithmetic-Stress Process 
(30 mimutes) 


Driving-3 Process EAR Calculated 
(10 mimutes) & EEG Recorded 


Figure 3. Scenario of drowsiness detection experiment 


First stage, the entire respondent did Driving-1 process (duration: 2 minutes), this is the baseline 
condition of respondent. Second stage is continuing to Driving-2 process (duration: 10 minutes). Third stage 
is arithmetic-stress process (duration: 30 minutes), the objective of this process is to make it the respondent 
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feel fatigue. The last stage is Driving-2 process (duration: 10 minutes). Total duration times of experiment 
are 52 minutes. 

Respondent in this research using driver simulator, and the camera is placed in front of the 
respondent's face, so the eye of aspect area (EAR) respondent can be observed and calculated. While driving 
process, respondent also using the neuroheadset emotiv EPOC, to record EEG signals. Emotiv EPOC 14 
channels are devices for recording electrical activity in the brain, and the location of the electrodes can be 
seen in Figure 4 [28]. 


Figure 4. Neuroheadset emotiv EPOC 14 electrode channels and locations 


Emotiv EPOC has several components, namely, a headset, universal USB receiver, and mini-B to A 
USB cable. The headset must be mounted in the right position on the respondent's head and connected to the 
Emotiv PRO software via a universal USB receiver. The results of this experiment are then saved. The 
Driving-1 process is considered a baseline condition, is the respondent's condition is considered awake. The 
drowsiness condition of the driver was observed based on the Eye Aspect ratio (EAR) and the respondent's 
EEG signal behavior from Driving-2 process and Driving-3 process. EEG signal recording is done 
simultaneously with EAR calculated. Block diagram of the design of drowsiness detection systems using 
SVM can be seen in Figure 5, which consists of several processes, namely preprocessing, feature extraction, 


and classification. 
[ : i Feature Extraction / 
Lx (Wavelet Transform) j 


i 7 ‘ Feature Extraction : 
Lx (Wavelet Transform) 


Figure 5. Drowsiness detection using wavelet transform and SVM system block diagram 


Classification Drowsiness 
(SVM) / Awake 


2.2.2. Preprocessing 

In this research, EEG signals were recorded using 14 channel Emotiv EPOC Neuroheadset, using 
frontal channels (F3, F4, F7, F8) and occipital (O1 and O2). The signals used in this research are alpha, high 
beta, low beta and theta. Display of EEG signal recording in Emotiv PRO software can be seen in Figure 6. 
At the preprocessing stage, the data of EEG signals separated to training data and testing data. For training 
data, EEG signals are grouped into drowsiness signals and non-drowsiness signals based on EAR. Awake 
condition signal was taken from the Driving-1 process, then the drowsiness condition signal was taken from 
Driving-2 process and Drive-3 process. If the EAR is below the threshold for 1.5 seconds or 38 consecutive 
frames, then the respondent is considered drowsy [6]. Then the EEG signal is segmented each 2 minutes, so 
from Driving-2 process and Driving-3 process there are five segmented data per 2 minutes (from 10 minutes 
EEG signal). 
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Figure 6. Display of EEG signal recording in emotiv PRO software 


2.2.3. Feature extraction 

After the signal pre-processing stage process, the next step is feature extraction. Feature extraction 
used in this research is wavelet transform. The wavelet type used is Daubechies (Db4) at level 4. Level 4 was 
chosen because at this level the signal decomposition results most closely resemble the original signal. 
Figure 7(a) is the raw data signal EEG F3 Alpha, then after the wavelet transform process can be seen in 
Figure 7(b). Wavelet coefficients from each channel EEG signal used as feature extraction. 


Signal Approximation at level 4 (reconstructed) 


Figure 7. EEG signal of F3 alpha; (a) raw data, (b) after wavelet transform 


2.2.4. Classification 

In this research the classification method used is SVM. The validation technique used is k-fold cross 
validation with k=5, chosen because for each driving process have 5 set data. In K-fold cross-validation, 
dataset X is randomly divided into K parts of equal magnitude X_i, i = 1,2, ... K. To produce each pair of 
data is to store one of the K parts into a validation set, then combine with parts the remaining K-1 is to form 
the training set. Done K times (Ethem): 


VA = X, Ty =X, U X3 U UX, 

V, = Xz, Tz = X, UX3 UU X 
2 2 2 ‘ 3 k (5) 

Vi = Xk, Tx = XxX, U X2 Ue UXx_4 

In training process, several kernel classifier SVM testing to choosen the highest accuration. Then 


from the best kernel chosen, continued to training process. Data classify into two classes, namely | 
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(drowsiness class) and -1 (not drowsiness/ awake class). After the SVM testing process, contains a class 
prediction for each data consisting of 66 outputs is the result of classification consisting of 1 (drowsiness 
class) and -1 (not drowsiness/ awake class). In this research, indication of drowsiness respondent is 
determined if at least 2/3 of the 66 outputs (there are 44 output) stated "1" (drowsiness class), 2/3 (66%) 
chosen because this decision greater than 50% SVM output in order to determine a clear classification for 
each class. 


3. RESULTS AND DISCUSSION 

Wavelet coefficients used in this research were 66 coefficients used as input to the SVM in the 
training and testing process. In the classification training process using SVM, the validation technique used is 
5-fold cross validation. From various kernels tested, it was found that the highest level of accuracy in the 
SVM training process using a quadratic kernel reached 84.5%.Thus the kernel used is the Quadratic kernel, 
see Table 1. 


Table 1. Kernel accuration 


Kernel Accuration 
Linear 71.8% 
Quadratic 84.5% 
Cubic 52.3% 

Fine Gaussian 72.2% 
Medium Gaussian 70.2% 
Coarse Gaussian 59.6% 


After the training process then proceed to the testing process, which is carried out on Driving-2 
process and Driving-3 process. In the Driving-2 process the results of the SVM classification of 10 
respondents can be seen in Figure 8(a). If the total SVM output is greater than 2/3 from output SVM, then the 
class is declared as drowsiness (1), else the class is declared as awake (-1), performed on all five data for 
each respondent in the Driving-2 Process, see Figure 8(b). For example for respondent R1, respondent was 
detected drowsiness in the Data2-2, Data2-3 and Data2-4. Whereas in the Data2-1 and Data2-5 detected in 
the awake conditions (no drowsiness). So, totally in 10 minutes process, the driver is majority drowsy (R1, 
R2, R5, R6, R7, RY, and R10). And there 3 respondents were detected as awake class (R3, R4, and R8). 


SVM Classification from Driving-2 Process Final Classification from Driving-2 Process 
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Figure 8. SVM classification and final classification from Driving-2 process; (a) SVM classification, 
(b) final classification 


In the Drive-3 process the results of the SVM classification of 10 respondents can be seen in 
Figure 9(a). Then the final classification, see Figure 9(b). The final classification results can be seen that the 
results of the majority drowsy are respondents R1, R2, R5, R6, R7, and R9, and the respondent identified as 
not drowsy/ awake are respondents R3 , R4, R8 and R10. 
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Figure 9. SVM classification and final classification from Driving-3 Process; (a) SVM classification, 
(b) final classification 


4. CONCLUSION 

In this research has successfully used EEG signals based on modified EAR for drowsiness detection 
by using wavelet transform as a feature extraction and SVM as a classifier. With 5-fold cross validation, in 
this research quadratic kernel (84.5%) produces the best level of accuracy compared to the type of linear 
kernel (72.2%), cubic (52.3%), fine gaussian (72.2%), medium gaussian (70.2%), and coarse gaussian 
(59.6%). In testing Driving-2 process data results,7 respondents were detected as drowsiness class (R1, R2, 
R5, R6, R7, R9, and R10), and 3 respondents were detected as awake class (R3, R4, and R8). In Driving-3 
process there 6 respondents were detected as drowsiness class (RI, R2, R5, R6, R7, and R9), and 4 
respondents were detected as awake class (R3, R4, R8 and R10). Because in this experiment drowsiness 
detection was observed in a certain time period, so for the next research to develop drowsiness detection with 
a short time in real time, by utilizing several other combination methods. 
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